Saliency-based multifoveated MPEG compression
نویسندگان
چکیده
Most current foveation strategies are limited to foveating sequences based on a direct measurement or an implicit assumption of the gaze direction. Such approaches often fail in unconstrained environments or when necessary equipment is absent. Alternatively, a computational model of visual attention may be used to predict visually salient locations. We describe such a neurobiological model of attention and its specific application to foveated video compression. The algorithm is demonstrated to be successful in foveating to Regions Of human Interest in a variety of video segments, including synthetic as well as natural scenes, and also gives good compression ratios.
منابع مشابه
Selective H.264 Video Coding Based on a Saliency Map
The demand in modern multimedia data transmission is continually increasing. New compression standard, such as the recent H.264/MPEG-4 AVC video coding standard, drastically improves the compression ratio. This higher compression ratio is required because the amount of multimedia data to transmit increases and that the perceived quality expected by the end user is not lessened. A complementary ...
متن کاملAn Embedded Saliency Map Estimator Scheme: Application to Video Encoding
In this paper we propose a novel saliency-based computational model for visual attention. This model processes both top-down (goal directed) and bottom-up information. Processing in the top-down channel creates the so called skin conspicuity map and emulates the visual search for human faces performed by humans. This is clearly a goal directed task but is generic enough to be context independen...
متن کاملNo-Reference Video quality assessment of H.264 video streams based on semantic saliency maps
The paper contributes to No-Reference video quality assessment of broadcasted HD video over IP networks and DVB. In this work we have enhanced our bottom-up spatio-temporal saliency map model by considering semantics of the visual scene. Thus we propose a new saliency map model based on face detection that we called semantic saliency map. A new fusion method has been proposed to merge the botto...
متن کاملA New Unequal Error Protection Technique Based on the Mutual Information of the MPEG-4 Video Frames over Wireless Networks
The performance of video transmission over wireless channels is limited by the channel noise. Thus many error resilience tools have been incorporated into the MPEG-4 video compression method. In addition to these tools, the unequal error protection (UEP) technique has been proposed to protect the different parts in an MPEG-4 video packet with different channel coding rates based on the rate...
متن کامل–high-fidelity Imaging– the Computational Models of the Human Visual System in High Dynamic Range Video Compression, Visible Difference Prediction and Image Processing Dissertation
As new displays and cameras offer enhanced color capabilities, there is a need to extend the precision of digital content. High Dynamic Range (HDR) imaging encodes images and video with higher than normal bit-depth precision, enabling representation of the complete color gamut and the full visible range of luminance. This thesis addresses three problems of HDR imaging: the measurement of visibl...
متن کامل